SEQUENCE LISTING 



110> Tchaga, Grigory S. 
Jokhadze, George 

<120> Metal Ion Affinity Tags and Methods for 
Using the Same 



<130> CLON-056CIP 

<140> US 09/858,332 
<141> 2001-05-15 

<150> 09/404,017 
<151> 1999-09-23 

<150> 60/101,867 
<151> 1998-09-25 

<160> 33 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 1 

His Leu lie His Asn Val His Lys Glu Glu His Ala His Ala His Asn 
1 5 10 15 



<210> 2 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 2 

His Asp Asp His Asp Asp His Asp Asp His Asp Asp His Asp Asp His 

15 10 15 

Asp Asp 



<210> 3 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 



1 



<223> affinity peptide 



<400> 3 

His Glu Glu His Glu Glu His Glu Glu His Glu Glu His Glu Glu His 

15 10 15 

Glu Glu 



<210> 4 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<40O> 4 

His Asp Glu His Asp Glu His Glu Asn His Glu Asn His Glu Asp His 

1 5 10 15 

Glu Asp 



<210> 5 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 5 

His Glu Asp His Glu Asp His Glu Asp His Glu Asp His Glu Asp His 

1 5 '10 15 

Glu Asp 



<210> 6 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 6 

Asp Asp Asp Asp Lys 
1 5 



<210> 7 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> factor Xa cleavage site 



2 



<4 00> 7 

lie Glu Gly Arg 
1" 



<210> 8 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> thrombin cleavage site 
<400> 8 

Leu Val Pro Arg Gly Ser 
1 5 



<210> 9 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> renin cleavage site 
<400> 9 

His Pro Phe His Leu Val lie His 
1 5 



<210> 10 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an immunological tag 
<400> 10 

Cys" Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
15 10 



<210> 11 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an immunological tag 
<400> 11 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 12 



3 



<211> 11- 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an immunological tag 
<400> 12 

Cys Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
15 10 



<210> 13 
<211> 3426 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA sequence of vector containing cDNA of 
recombinant enterokinase 



<400> 13 

gacgaaaggg 

cttagacgtc 

tctaaataca 

aatattgaaa 

ttgcggcatt 

ctgaagatca 

tccttgagag 

tatgtggcgc 

actattctca 

gcatgacagt 

acttacttct 

gggatcatgt 

acgagcgtga 

gcgaactact 

ttgcaggacc 

gagccggtga 

cccgtatcgt 

agatcgctga 

catatatact 

tcctttttga 

cagaccccgt 

gctgcttgca 

taccaactct 

ttctagtgta 

tcgctctgct 

ggttggactc 

cgtgcacaca 

agctatgaga 

gcagggtcgg 

atagtcctgt 

gggggcggag 

gctggccttt 

ttaccgcctt 

cagtgagcga 

cgattcatta 

acgcaattaa 

cggctcgtat 



cctcgtgata 
aggtggcact 
ttcaaatatg 
aaggaagagt 
ttgccttcct 
gttgggtgca 
ttttcgcccc 
ggtattatcc 
gaatgacttg 
aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 
agttatctac 
gataggtgcc 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tgagtgagct 
ggaagcggaa 
atgcagctgg 
tgtgagttag 
gttgtgtgga 



cgcctatttt 
tttcggggaa 
tatccgctca 
atgagtattc 
gtttttgctc 
cgagtgggtt 
gaagaacgtt 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
tcggcccttc 
cgcggtatca 
acgacgggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
gataccgctc 
gagcgcccaa 
cacgacaggt 
ctcactcatt 
attgtgagcg 



tataggttaa 
atgtgcgcgg 
tgagacaata 
aacatttccg 
acccagaaac 
acatcgaact 
ttccaatgat 
ccgggcaaga 
caccagtcac 
ccataaccat 
aggagctaac 
aaccggagct 
tggcaacaac 
aattaataga 
cggctggctg 
ttgcagcact 
gtcaggcaac 
agcattggta 
atttttaatt 
cttaacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
gccgcagccg 
tacgcaaacc 
ttcccgactg 
aggcacccca 
gataacaatt 



tgtcatgata 
aacccctatt 
accctgataa 
tgtcgccctt 
gctggtgaaa 
ggatctcaac 
gagcactttt 
gcaactcggt 
agaaaagcat 
gagtgataac 
cgcttttttg 
gaatgaagcc 
gttgcgcaaa 
ctggatggag 
gtttattgct 
ggggccagat 
tatggatgaa 
actgtcagac 
taaaaggatc 
gttttcgttc 
tttttttctg 
ttgtttgccg 
gcagatacca 
tgtagcaccg 
cgataagtcg 
gtcgggctga 
actgagatac 
ggacaggtat 
gggaaacgcc 
atttttgtga 
tttacggttc 
tgattctgtg 
aacgaccgag 
gcctctcccc 
gaaagcgggc 
ggctttacac 
tcacacagga 



ataatggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
agcggtaaga 
aaagttctgc 
cgccgcatac 
cttacggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 
ggtaagccct 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagggt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
gataaccgta 
cgcagcgagt 
gcgcgttggc 
agtgagcgca 
tttatgcttc 
aacagctatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 



4 



accatgatta 
gctcatgccc 
ccttgggtcg 
agggattggc 
tggaaagcag 
aggttgattg 
attgccatga 
ttaccagaag 
gcacttatat 
tcaaatgaga 
gcaggctatg 
tgccaagaaa 
ctgcctaatc 
tttctacatg 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 
catatggtgc 
cccgccaaca 
acaagctgtg 
acgcgc 



cgccaagctt 
acaacaagat 
ttgctctgta 
tggtgtcggc 
tgctaggcct 
accaaattgt 
tgcatcttga 
aaaatcaagt 
atcaaggttc 
aatgtcaaca 
aagcaggagg 
acaacagatg 
gcccaggggt 
agctcgtaat 
cctggcgtta 
agcgaagagg 
cgcctgatgc 
actctcagta 
cccgctgacg 
accgtctccg 



gaaggatcat 
cgatattgtc 
tttcgacgat 
cgcccactgc 
gcatatggca 
cataaaccca 
aatgaaagtg 
ttttccccca 
tactgcagac 
acagatgcca 
ggtagattct 
gctcctggct 
gtatgcccgg 
tagctgagaa 
cccaacttaa 
cccgcaccga 
ggtattttct 
caatctgctc 
cgccctgacg 
ggagctgcat 



ctcatccaca 
ggaggaagtg 
caacaggtct 
gtgtacggga 
tcaaatctga 
cactacaata 
aactacacag 
ggaagaattt 
gtactgcaag 
gaatataaca 
tgtcaggggg 
ggcgtgacgt 
gtcccaaggt 
ttcactggcc 
tcgccttgca 
tcgcccttcc 
ccttacgcat 
tgatgccgca 
ggcttgtctg 
gtgtcagagg 



atgtccacaa 
actccagaga 
gcggagcttc 
gaaatatgga 
cttctcctca 
aacggagaaa 
attatataca 
gttctattgc 
aagctgacgt 
ttacggaaaa 
attcaggcgg 
catttggata 
tcacagagtg 
gtcgttttac 
gcacatcccc 
caacagttgc 
ctgtgcggta 
tagttaagcc 
ctcccggcat 
ttttcaccgt 



agaggagcac 
aggagcctgg 
tctggtgagc 
gccgtctaag 
gatagaaact 
gaacaatgac 
gcctatttgt 
tggctggggg 
tccccttcta 
tatggtgtgt 
accactcatg 
tcaatgtgca 
gatacaaagt 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 
tttcacaccg 
agccccgaca 
ccgcttacag 
catcaccgaa 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3426 



<210> 14 
<211> 269 . 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> protein sequence of vector containing cDNA of 
recombinant enterokinase 



<400> 14 



Met 


Thr 


Met 


He 


Thr 


Pro 


Ser 


Leu 


Lys 


Asp 


His 


Leu 


He 


His 


Asn 


Val 


1 








5 










10 










15 




His 


Lys 


Glu 


Glu 


His 


Ala 


His 


Ala 


His 


Asn 


Lys 


He 


Asp 


He 


Val 


Gly 








20 










25 










30 






Gly 


Ser 


Asp 


Ser 


Arg 


Glu 


Gly 


Ala 


Trp 


Pro 


Trp 


Val 


Val 


Ala 


Leu 


Tyr 






35 










40 










45 








Phe 


Asp 


Asp 


Gin 


Gin 


Val 


Cys 


Gly 


Ala 


Ser 


Leu 


Val 


Ser 


Arg 


Asp 


Trp 




50 










55 










60 










Leu 


Val 


Ser 


Ala 


Ala 


His 


Cys 


Val 


Tyr 


Gly 


Arg 


Asn 


Met 


Glu 


Pro 


Ser 


65 










70 










75 










80- 


Lys 


Trp 


Lys 


Ala 


Val 


Leu 


Gly 


Leu 


His 


Met 


Ala 


Ser 


Asn 


Leu 


Thr 


Ser 










85 










90 










95 




Pro 


Gin 


He 


Glu 


Thr 


Arg 


Leu 


He 


Asp 


Gin 


He 


Val 


He 


Asn 


Pro 


His 








100 










105 










110 






Tyr 


Asn 


Lys 


Arg 


Arg 


Lys 


Asn 


Asn 


Asp 


He 


Ala 


Met 


Met 


His 


Leu 


Glu 






115 










120 










125 








Met 


Lys 


Val 


Asn 


Tyr 


Thr 


Asp 


Tyr 


He 


Gin 


Pro 


He 


Cys 


Leu 


Pro 


Glu 




130 










135 










140 










Glu 


Asn 


Gin 


Val 


Phe 


Pro 


Pro 


Gly 


Arg 


He 


Cys 


Ser 


He 


Ala 


Gly 


Trp 


145 










150 










155 










160 


Gly Ala 


Leu 


He 


Tyr 


Gin 


Gly 


Ser 


Thr 


Ala 


Asp 


Val 


Leu 


Gin 


Glu 


Ala 










165 










170 










175 




Asp 


Val 


Pro 


Leu 


Leu 


Ser 


Asn 


Glu 


Lys 


Cys 


Gin 


Gin 


Gin 


Met 


Pro 


Glu 








180 










185 










190 






Tyr 


Asn 


He 


Thr 


Glu 


Asn 


Met 


Val 


Cys 


Ala 


Gly 


Tyr 


Glu 


Ala 


Gly 


Gly 






195 










200 










205 








Val 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Met 


Cys 


Gin 


Glu 
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210 • 215 
Asn Asn Arg Trp Leu Leu Ala Gly 
225* 230 
Ala Leu Pro Asn Arg Pro Gly Val 
245 

Glu Trp lie Gin Ser Phe Leu His 
260 



220 

Val Thr Ser Phe Gly Tyr Gin Cys 
235 240 
Tyr Ala Arg Val Pro Arg Phe Thr 

250 255 
Glu Leu Val lie Ser 
265 



<210> 15 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of- the affinity 
purification site 

<400> 15 

His Asn His Asn His Asn His Asn His Asn His Asn 
1 5 .10 



<210> 16 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

. <400> 16 

catctcatcc acaatgtcca caaagaggag cacgctcatg cccacaac 48 

<210> 17 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 17 

cataaccata accataacca taaccataac cataac 36 

<210> 18 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 18 

catgatgatc atgatgatca tgatgatcat gatgatcatg atgatcatga tgat 54 



6 



<210> 19- 
<21\> 54 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 19 

catgaggagc atgaggagca tgaggagcat gaggagcatg aggagcatga ggag 54 



<210> 20 
<211> 54 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 20 

catgatgagc atgatgagca tgagaaccat gagaaccatg aggatcatga ggat 54 

<210> 21 
<211> 9 
<212> PRT 
<213> Human 

<220> 

<221> VARIANT 
<222> 9 

<223> Xaa at position 9 is an amino acid with an 
aliphatic or amide side chain. 

<220> 

<221> VARIANT 
<222> 2 

<223> Xaa at position 2 is an amino acid with an 
aliphatic or amide side chain 



<220> 

<221> VARIANT 
<222> 3 

<223> Xaa at position 3 is an amino acid with an 
aliphatic or amide side chain. 



<220> 

<221> VARIANT 
<222> 5 

<223> Xaa at position 5 is an amino acid with a basic 
side chain ^(except HIS) or an acidic side chain. 

<220> 

<221> VARIANT 
<222> 6 

<223> Xaa at position 6 is an amino acid with a basic 
side chain (except HIS) or an acidic side chain. 



7 



<220> 

<22\> VARIANT 
<222> 7 

<223> Xaa at position 7 is an amino acid with a basic 
side chain (except HIS) or an acidic side chain 

<400> 21 

His Xaa Xaa His. Xaa Xaa Xaa His Xaa 
1 5 



<210> 22 
<211> 6 
<212> PRT 
<213> Human 

<220> 

<400> 22 

His Arg His Arg His Arg 
1 5 



<210> 23 
<211> 9 
<212> PRT 
<213> Human 

<220> 

<221> VARIANT 
<222> 2 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 3 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 5 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 6 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 8 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 9 

<223> Xaa = an amino acid having an acidic side chain 
<400> 23 



8 



His Xaa Xaa His Xaa Xaa His Xaa Xaa 
1 . 5 



<210> 24 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 
<400> 24 

Leu Pro Pro Leu Ser Glu Leu lie Pro Leu Ala Ala Ala Glu Arg Pro 

1.5 10 15 

Ser Ala Ala Ser Gin 
20 



<210> 25 
<211> 38 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 
<400> 25 

Ala Arg Lys Arg Lys Ser Ala Gin Tyr Ala Asn Arg Leu Ser Pro Arg 

15 10 15 

Val Gly Arg Phe He Asn Ala Ala Gly Thr Thr Gly Phe Pro Thr Gly 

20 25 30 

Lys Arg Ala Val Ser Ala 
35 



<210> 26 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 
<400> 26 

Glu Phe Thr Gly Arg Arg Phe Thr Thr Ser 
1 5 10 



<210> 27 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 27 
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His Asn His Asn His Asn 
1 . 5 



<210> 28 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 28 

His Asn His Asn His. Asn His Asn 
1 5 



<210> 29 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 29 

His Asn His Asn His Asn His Asn His Asn 
1 5 10 



<210> 30 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 30 

His Asn His Asn His Asn His Asn His Asn His Asn His Asn 
1 5 10 



<210> 31 
<211> 16 
<212> PRT 

<213> Artificial Sequence . 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 31 

His Asn His Asn His Asn His Asn His Asn His Asn His Asn His Asn 
1 5 10 15 
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<210> 32 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 32 

His Asn His Asn His Asn His Asn His Asn His Asn His Asn His Asn 

1 5 10 15 

His Asn 



<210> 33 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 33 

His Asn His Asn His Asn His Asn His Asn His Asn His Asn His Asn 

1 5 10 15 

His Asn His Asn 
20 
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